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^ (54) Title: HEPATITIS B CORE ANTIGEN FUSION PROTEINS 

QO (57) Abstract: The hepatitis B virus (HBV) capsid is made up of a single species of protein called the core antigen (HBcAg) which 
self-assembles into particles. The panicles are highly immunogenic and are able to present heterologous epitopes to the immune 
system when the epitopes are inserted into a surface-exposed region of the particles called the "el loop". The structural building 
1^ blocks of the particles are lightly associated dimers of HBcAg in which the adjacent el loops are closely juxtaposed. It is proposed 
"^■^ that sequences inserted into the el loop are conformationally restrained in the assembled particles when presented in monomelic 
core protein. The invention seeks to solve this problem by covalently linking core proteins as tandem copies, e.g., as dimers, so 
that insertions can be made independently in each copy. This is particularly useful for insertion of large sequences into the el loop 
because it allows such sequences to be inserted into just one copy of the core protein per tandem repeat, thereby reducing potential 
conformational clashes in assembly. Alternatively, a diflerenl sequence may be inserted into each el loop of a tandem repeat, thus 
increasing the flexibility of HBcAg particles as an epitope delivery system. 
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PTRP ATfTTR B CORE ANTIGEN FUSION PROTEINS 

The invention relates to hepatitis B core antigen fusion proteins^ particles containing the 
proteins, nucleic acid molecules encoding the proteins, processes for producing the 
5 proteins, pharmaceutical compositions containing the proteins and iise of the proteins in 
prophylactic and therapeutic vaccination. 

Background to the invention 

10 Hepatitis B is a major healthcare problem throughout both the developed and developing 
world. Infection with the hepatitis B virus (HBV) can result in an acute or chronic disease 
which in a proportion of cases may lead to hepatocellular carcinoma and death. The virus 
is double shelled, and its DNA is protected inside a protein structure called the core 
antigen (HBcAg). The core is surrounded by the envelope protein known as tilie surface or 

15 S antigen (HBsAg). HBc^ is an unusual antigen which can be used as a deliv«y vehicle 
for specific peptides to the immune system. The antigen has been used to present 
T-helper, B and cytotoxic lymphocyte (CTL) epitopes firom a variety of viral and bacterial 
pathogens, including epitopes from the surface antigen of HBV, envelope proteins firom 
hepatitis A virus and antigens fi-om hepatitis C virus. For a review see Ulrich et al (1998) 

20 Advances in Virus Research 50 141-182. 

HBcAg is an excellent vehicle for the presentation.of epitopes due to the molecular 
structure of the protein, which self-assembles into particles. Each particle is generated 
&om either 1 80 or 240 copies of a monomeric polypeptide. The monomer, on reaching an 
25 appropriate concentration inside the host cell, forms a particle of approximately 27 nm in 
diameter. Structural studies have shown that amino acids within the region firom residues 
68 to 90 form a spiked structure on the surface of the particle which is known as the el 
loop. Two monomers joined by disulphide bonds link to form a dimer spike, the most 
exposed amino acid being at position 80 (at the centre of the el loop). 

30 
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* 

EP-A-421635 (The Wellcome Foundation Limited) describes modification of the HBV 
core gene to allow insertion of foreign epitopes into the el loop without altering the 
potential of the protein to fix)m particles. Insertion at this site allows maximum exposure 
of the inserted epitope on the tip of each spike created by dimers of the protein. As there 
5 are approximjately 180 or 240 copies of each monomer per particle, each particle is able to 
present 1 80 or 240 copies of the epitope of int^^ 

Sniiimflr y of the invention 

10 In the dimers of HBcAg \^4iich form the structural building blocks of core particles, 

adjacent el loops are closely juxtaposed. It is proposed that sequences inserted into the el 
loop are conformationally restrained in the assembled particles when presented in 
monomeric core protein. The invention seeks to solve this problem by covalentiy linking 
core proteins as tandem copies, e.g. as dimers, so that insertions can be made 

IS independently in each copy. Thisisparticularly useful for insertion of large sequences 
into the el loop because it ailov^ such sequences to be inserted into just one copy of the 
core protein per tandem repeat, thereby reducing potential conformational clashes in 
assembly. Alternatively, a different sequence may be inserted into each el loop of a 
tandem repeat, thus increasing the flexibility of HBcAg particles as an epitope 

20 presentation system. 

Thus, the invention provides a protein comprising tandem copies of HBcAg. The protein 
is generally a dimer comprising two copies of HBcAg. A heterologous epitope may be 
inserted into the el loop of one or more of the copies of HBcAg. The protein assembles 
25 into particles which present the heterologous epitope inserted in the el loop on their 
surfjEices and are use&l in the prophylactic and therapeutic vaccination of humans and 
animals. 



30 
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Detailed description of the inventinn 
The protein 

5 

The basic building block of the protein of the invention is HBcAg, v^ch has 1 83 or 1 85 
amino acids (aa) depending on the subtype of HBV. The sequence of the 183 ammo acid 
protein of the ayw subQT)e plus a 29 amino acid pre-sequ«ice is shown in SEQ ID No. 2. 
The mature HBcAg runs jBx>m the Met residue at position 30 to the Cys residue at the 
10 extreme C-tenninus, with the sequence fix>m positions 1 to 29 being a pre-sequence. 

The protein generally comprises only two copies of HBcAg forming a dimer because 
dimers of HBcAg form the structural building blocks of core particles. However, the 
protein may comprise further copies of HBcAg. Thus, the protein may comprise from 2 to 
15 8 copies or from 2 to 4 copies of HBcAg. The use ofmore than two copies increases the 
flexibility of the system; for example, the use of three copies allows three different 
epitopes to be inserted into three el loops in tiie protem of the invention and thereby 
increases the breadth of tiie immune response induced by the protein of the invention. 

20 The HBcAg units are generally joined together in a head-to-toe fashion, i.e. the C- 
temiinus of one imit is joined to the N-terminus of the adjacent unit. The imits may be 
joined directly by a covalent bond (e.g. a peptide bond), but preferably they are joined by a 
linker which spaces the adjacent units apart and thereby prevents any problem with 
dismption of the packing of adjacent units. The nature of the linker is discussed below. 

25 

One or more of the HBcAg units in the protein of the invention may be native full length 
HBcAg. However, generally at least one of the units is a modified form of HBcAg, for 
example HBcAg modified by insertion of a heterologous q)itope in the el loop. In dimers 
according to the invention, one of the HBcAg units may be modified and the othw may be 
30 native HBcAg. 
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As a general rule, any modifications are chosen so as not to interfere with the 
conformation of HBcAg and its ability to assemble into particles. Such modifications are 
made at sites in the protein which are not important for maintenance of its conformation, 
for example in the el loop, the C-temoinns and/or the N-tenninus, Tbe el loop of HBcAg 
S can tolerate insertions of e.g. from 1 to 120 amino acids without destroying the particle* 
fonning ability of the protein. 

.The HBcAg sequence may be modified by a substitution, insertion, deletion or extension. 
The size of insertion, deletion or extension may, for example, be from 1 to 200 aa, fix)m 3 

10 to 100 aa or from 6 to 50 aa. Substitutions may involve a number of amino acids up to, for 
example, 1, 2, 5, 10, 20 or 50 amino acids over the length of the HBcAg sequence. An 
extension may be at the N- or C-terminus of HBcAg. A deletion may be at the N- 
terminus, C-terminus or at an internal site of the protein. Substitutions may be made at 
any position in the protein sequence. Insertions may also be made at any point in the 

1 5 protein sequence, but are ^pically made in sur£ace-e3q)osed regions of the protein such as 
the el loop. An inserted sequence may carry a heterologous epitope. More than one 
modification may be made to each HBcAg unit. Thus, it is possible to make a terminal 
extension or deletion and also an internal insertion. For example, a truncation may be 
made it the C-temiinus and an insertion may be made in the el loop. 

20 

Substitutions will generally be conservative and may be made, for example, according to 
the following Table, in which amino acids in the same block in the second colmim and 
preferably in the same line in tiie third column may be substituted for each other. 
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Each part of the HBcAg sequence in the protein of the invention preferably has at least 
70% sequence identity to the corresponding sequence of a natural HBcAg protein^ such as 
the protein having the sequence shown in SBQ ID NO: 2. Morfe prefCTably, the identity is 
5 at least 80%, at least 90%, at least 98%, at least 97% or at least $9%. Methods of 
measuring protein sequence (and nucleic acid Sequence) identity are well known in the art 
For example, the UWGCG Package provides the BESTFTT programme (Devereux et al 
(1984) Nucleic Acids Research 12, p.387-395). Similarly, the PILEUP and BLAST 
algorithms can be used to line up sequences (for example as described in Altschul S. F. 
10 (1993) J. Mol Evol: 36:290-300 and Altschul, S, F. et al (1990) J. Mol Biol 215:403-10). 

The el loop of HBcAg is at positions 68 to 90, and a heterologous epitope may be inserted 
anywhere between these positions. Preferably, the epitopcis inserted in the region fiom 
positions 69 to 90, 7 1 to 90 or 75 to 85. Most preferred is to msert the epitope between 

15 ammo acid residues 79 and 80 or between residues 80 and 81 . When a heterologous 
epitope is inserted, the entire sequence of HBcAg may be maintained, or alternatively the 
whole or a part of the el loop sequence may be deleted and replaced by the heterologous 
sequence. Thus, amino acid residues 69 to 90, 71 to 90 or 75 to 85 may be replaced by a 
heterologous epitope. Where a heterologous epitope replaces el loop sequence, the 

20 epitope is generally not shorter than the sequence that it replaces. 

A C-terminal truncatipn of HBcAg will generally not go beyond aa 144 because if any 
further truncation is made particles may not form. Thus, the deleted amino acids may, for 
example, comprise aa 144 to the C-terminal aa (aa 183 or 185), aa 150 to the C-terminal 
25 aa, aa 1 64 to the C-terminal aa or aa 1 72 to the C-terminal aa. The C-terminus of HBcAg 
binds DNA, and truncation of the C-tCTminus therefore reduces or completely removes 
DNA from preparations of HBcAg and HBcAg hybrid proteins. 

The protein of the invention forms particles \^ch preferably resemble the particles 
30 formed by native HBcAg. The particles of the invention are typically at least 10 nm in 
diameter, for example from 1 0 to 50 nm or from 20 to 40 nm in diameter, but preferably 
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they are about 27 um in diameter (which is the size of native HBcAg particles). They 
conq)rise multiple HBcAg units, for example from 150 to 300 units^ but generally they are 
fixed to about 1 80 or about 240 imits (which are the numbers of units in native HBcAg 
particles). 'Where the protein of the invention is a dimer, this means that the number of 
5 protein monomers in the particles may be from 75 to 150 but is generally about 90 or 
about 120. 

The linker between adjacent HBcAg units is generally a chain of anodno acids at least 
1.5 mn (15 A) in length, for example from L5 to 10 mn, from 1.5 to 5 nm or from 1.5 to 3 

10 nm. It may, for example, comprise 4 to 40 aa or 10 to 30 aa, preferably 15 to 21 aa. The 
linker is generally flexible. The anmio acids in the linker may, for example, include or be 
entirely composed of glycine, serine and/or proline. A preferred linker comprises one or 
more repeats of the sequence GlyGlySer (GrGS). Alternatively, the linker may comprise 
one or more GlyPro (GP) dipeptide repeats. The number of repeats may, for example, be 

15 from 1 to 18, preferably from 3 to 12. In the case of GGS repeats, the use of 5, 6 or 7 
repeats has been found to allow the formation of particles. The linker may correspond to 
the hinge region of an antibody; this hinge region is thought to provide a flexible joint 
between the antigen-binding and tail domains of antibodies. 

20 As iadicated above, a heterologous epitope may be inserted into one or more of the copies 
of HBcAg in the protein of the invention, preferably into the el loop. A "heterologous" 
epitope is an epitope tiiat is not normally located at the position at which it is located in the 
HBcAg; it is generally from a protein other than HBcAg but it may be from a different 
location in HBcAg. The epitope comprises a sequence of amino acids which raises an 

25 iirmiune response. The epitope may be conformational or linear. It may be, for example, 
in a sequence of from 6 to 120aa,from6to50aaorfrom6to20aa. A major advantage 
of the invention is that it allows epitopes carried on large sequences to be inserted into the 
el loop, for ^cample on sequences of from 30 to 120 aa, 40 to 120 aa or 60 to 120 aa. 

30 The protein of the invention may contain more than one heterologous epitope, for example 
up to 2, 3, 5 or 8 heterologous epitopes, and in this case the epitopes may be present in the 
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same or different HBcAg units. More than one copy of an qpitope may be inserted in each 
HBcAg unit; for exainple, from 2 to 8 copies may be inserted. Where there are two or 
more heterologous epitopes in the protein of the invention, they may be from the same or 
different organisnis and from the same or diflere^^ 
5 * 

The epitope may be a T-cell or a B-cell epitope. If it is a T-cell epitope, it may be a 
cytotoxic T-lymphocyte (CTL) epitope or a T-helper (Th) ceU epitope (e.g. a Thl or Th2 
epitope). In a preferred embodiment of the invention, one of the epitopes is a T-help^ cell 
epitope and another is a B-cell or a CTL epitope. The presence of the T-helper cell 
1 0 epitope enhances the immirne response against the B-cell or CTL epitope. 

The choice of epitope depends on the disease that it is wished to vaccinate against. The 
epitope may, for example, be from a pathogenic organism, a cancer-associated antigen or 
an allergen. The patiiogenic organism may, for example, be a virus, a bacterium or a 
15 protozoan. 

Examples of pathogens whose epitopes may be inserted include hepatitis A virus (HAV), 
HBV, HCV, influenza virus, foot-and-mouth disease virus, poliovirus, hetpes simplex 
virus, rabies virus, feline leukemia virus, human immunodeficiency virus type 1 (HIVl), 

20 human immunodeficiency virus type 2 (HIV2), simian immunodeficiency virus (SIV), 
human rhinovirus, dengue virus, yellow fever virus, htmian papilloma virus, respiratory 
syncytial virus, Plasmoditm falciparum (a cause of naalaria), and bacteria such as 
Afycobacteria, Bordetella, Salmonella, Escherichia, Vibrio, Haemophilus, Neisseria, 
Yersinia and Brucella. Specifically, the bacterium may be Afycobacterium tuberculosis - 

25 the cause of tuberculosis; Bordetella pertussis or Bordetella parapertussis - causes of 
whoopmg cough; Salmonella typhimurium - the cause of sahnonellosis m several animal 
species; Salmonella typhi - the cause of human typhoid; Salmonella enteritidis - a cause of 
food poisoning in humans; Salmonella choieraesms - a cause of sahnonellosis in pigs; 
Salmonella dublin - a cause of both a systemic and diarrhoeal disease in cattle, especially 

30 in new-bom calves; Escherichia coli - a cause of food poisoning in humans; Haemophilus 
influenzae - a cause of meningitis; Neisseria gonorrhoeae - a cause of gonnorrhoeae; 
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Yersinia enterocolitica - the cause of a spectrum of diseases in humans ranging fix>m 
gastroenteritis to fatal septicemic disease; and Brucella abortus - a cause of abortion and 
infertility in cattle and a condition kaowQ as undulant fever in humans. 

5 Examples of candidate epitopes for lise in the invention include epitopes firom the 
following antigCTs: the HIV aintigens gp 120, gp 160, gag, pol, NeJ^ Tat and Ref; the 
malaria antigens CS protein and Sporozoite sur&ce protein 2; the influenza antigens HA, 
NP and NA; the herpes virus antigens EBV gp340, EBV gp85, HSV gB, HSV gD, HSV 
gH, HSV early protein product, cytomegalovirus gB, cytomegalovirus and IE protein 

1 0 gP72; the human papilloma virus antigens E4, E6 and E7; the respiratory syncytial virus 
antigens F protein, G protein, and N protein; the pertactin antigen of B.pertussis; the 
tumor antigens carcinoma CEA, carcinoma associated mucin, carcinoma P53, melanoma 
MPG, melanoma P97, MAGE antigen, carcinoma Neu oncogene product, prostate specific 
antigen (PSA), prostate associated antigen, ras protein, and myc; and house dust mite 

15 allergen. 

Especially preferred epitopes are those from the pre-S 1 region, the pre-S2 region, tiie S 
. region or core antig^ of HBV. It is possible to insert the whole of the pre-S 1 and/or the 
whole of Ae pre-S2 region into HBcAg, but generally only a part of one of the r^ons is 

20 inserted. The inserted part is typically at least 6 amino acids in length, for example from 6 
to 120 aa, 20 to 80 aa or 20 to 50 aa. The insert may include, for example, the residues at 
pre-Sl positions 1-9, 10-19, 20-29, 30-39, 40-49, 50-59, 60-69, 70-79, 80-89, 90-99, 100- 
109 or 110-119 or the residues at pre-S2 positions 120-129, 130-139, 140-149, 150-159, 
160-169 or 170-174. Particularly preferred inserts are pre-Sl residues 20-47 and pre-S2 

25 residues 139-174. 

Making the proteins of the invention 

The proteins of the inv^tion are generally made by recombinant DNA technology. The 
30 invention includes a nucleic acid molecule (e.g. DNA or RNA) ^coding a protein of the 
invention, such as an egression vector. The nucleic acid molecules may be made using 
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Imowateclmiquesforiiiampxilati^ Typically, two separate DNA 

constructs encoding two HBcAg units are made and then joined together by overl^ping 
PGR. 

5 A protein of the invention may he produced by culturing a host Cell containing a nucleic 
molecule ^coding the protein und» conditions in wbich the protem is expressed^ and 
recovering the proteiiL Suitable host cells include bacteria such as E. coliy yeast» 
mammalian cells and other eukaryotic cells, for exanogple insect Sf9 cells. 

10 The vectors constituting nucleic acid molecules according to the invention may be, for 
example, plasmid or virus vectors. They may contain an origin of replication, a promoter 
for the expression of the sequence encoding the protein, a regulator of the promoter such 
as an enhancer, a transcription stop signal, a translation start signal and/or a translation 
stop signal. The vectors may also contain one or more selectable marker genes, for 

1 5 example an ampicillin resistance gene in the case of a bacterial plasmidl)r a neomycin 
resistance gene in the case of a mammalian vector. Vectors may be used in vitrOy for 
example for the production of RNA or used to transform or transfect a host cell. The 
vector may also be adapted to be used in vivo, for example in a mediod of gene ther^y or 
DNA vaccination. 

20 

Promoters, enhancers and other expression regulation signals n^y be selected to be 
compatible with the host cell for which the expression vector is designed. For example, 
prokaryotic prdnioters may be used, in particular those suitable for use in E. coli strains 
(such as E. coli HBlOl). A promoter whose activity is induced in response to a change in 

25 the surrounding environment, such as anaerobic conditions, may be used. Preferably an 
htrA or nirB promoter may be used. Tliese promoters may be used in particular to egress 
the protein in an attenuated bacterium, for example for use as a vaccine. When expression 
of the protein of the invention is carried out in mammalian cells, either in vitro or in vivo, 
m a mmali an promoters may be used* Tissue^specific promoters, for example hepatocyte 

30 cell-specific promoters, may also be used. Viral promoters may also be used, for example 
the Moloney murine leukaemia virus long terminal repeat (MMLV LTR), the rous 
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sarcoma virus (RS V) LTR promoter, the S V40 promoter, the human cytomegalovirus 
(CMV) IE promote, herpes simplex virus promoters and adenovirus promoters. All these 
promoters are readily available in the art 

S A protein according to the invention icnay be punfied using conventional techniques for 
purifying protdns. The protein may, for ^cample, be provided in purified, pure or isolated 
fonn. For use in a vacdne, the protein must generally be provided at a hi^ level of 
purity, for example at a level at which it constitutes more than* 80%, more than 90%, more 
than 95% or more than 98% of the protein in the preparation. However, it may be 
1 0 desirable to mix the protein with other proteins in the final vaccine formulation. 

Vaccination against diseases 

The prinoiary use of the proteins of the invention is as thersqpeutic or prophylactic vaccines. 
1 S The invention iQcludes a pharmaceutical composition (e.g. a vaccine composition) 

comprising a protein of the invention, a particle of the invention or a nucleic acid molecule 
of the invention tod a phaimaceutically acceptable carrier or diluent 

The principle behind prophylactic vaccination is to induce an inamune response in a host 
20 so as to generate an immunological memory in the host. This means that, when the host is 
exposed to the virulent pathogen, it mounts an effective (protective) immune response, i.e* 
an immune response which inactivates and/or kills the pathogen. The invention could 
form the basis of a prophylactic vaccine against a range of diseases and conditions, such as 
HBV, HAV, HCV, influenza, foot-and-mouth disease, polio, herpes, rabies, AIDS, dengue 
25 fever, yellow fever, malaria, tuberculosis, whooping cough, typhoid, food poisoning, 
diarrhoea, meningitis and gonorrhoea. The epitopes in the protein of the invention are 
chosen so as to be appropriate for the disease against which the vaccine is intended to 
provide protection. 

30 The principle behind therapeutic vaccination is to stimulate the immune system of the host 
to alleviate or eradicate a disease or condition. There are a number of diseases and 
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conditions which may be suscq)tible to therapeutic vaccination, such as cluronic viral 
diseases including chronic HBV and chronic HCV, cancer, and aUe^^ 
atopy, eczema^ rhinitis and food allergies. 

5 Chronic viral diseases arise v^eri the mnh^ 
the virus, aUowing the vhiis to persist in the host for a long ^ Theinvention 
may be used to mduce the unmune system of the chronically infected individual so as to 
elmunate the vmis. For example, is believed that patients with chronic hepatitis have an 
madequate T-cell response, and that stimulation of an appropriate T-cell response can 
10 eliminate flie virus. Thus, in order to treat chronic viral hepatitis using the invention, T- 
cell epitopes may be mserted into the protein of the mvention, such as T-cell epitopes from 
the pre-Sl and pre-S2 regions of HBV. 

Sunilarly, in tibie case of cancer, it is believed that enhancement of the T-cell response to 
15 tumour antigens may help the immune system to destroy Ihe tumour. It is believed that 
allergic diseases are caused at least in part by an unbalanced T-cell response in vMch an 
i n flam m a t ory Th2 responses dominates over an antagonistic Thl response, and that 
aUergiesmaytiiereforebetreatedby enhancing the Thl response. This can be achieved 
according to the invention by usmg a protrin containing a heterologous epitope which 
20 stimulates a Thl response. 

Suitable carriers and diluents for inclusion in pharmaceutical compositions of the 
invention are isotonic saline solutions, for example phosphate-buffered saline. The 
composition will normally include an adjuvant, such as aluminium hydroxide. The 

25 composition may be formulated in liquid form for injection. The composition comprises 
the protein, particles or nucleic acid in a prophylactically or therapeutically effective 
amount. Typically, the protein or particles are administered m a dose of from 0.1 to 200 
f^,prefCTably from 1 to 100 Kig, more preferably from lOto 50 jig body weight. The 
nucleic acid of the invention may be administered directly as a naked nucleic acid 

30 construct using techniques known in the art or using vectors known in the art The amount 
of nucleic acid administered is typically in the range of from 1 jig to 10 mg, preferably 
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from 100 jLig to 1 mg. The vacx^ine may be given in a single dose schedule or a multiple 
dose schedule, for example in from 2 to 32 or from 4 to 1 6 doses. The routes of 
administration and doses given above are intended only as a guide, and the route and dose 
may ultiniately be at the discretion of the physicialL 

5 ' ' • • . 

Experimft wtal SgcAiim 

Brirf description of the drawings 

10 Figure 1: A hypothetical model showing die feasibility of a linked AB dimer of hepatitis 
B core. 

F^ure 2: A schematic representation of the cdnstruction of hetero- and homo-tandem 
cores. The bars represent the primary structures of the proteins. Within the assembly 
15 domain of HBcAg (amino acids 1-144), the el loop (black rectangle) and the regions 
involved in intradimer (li^t shading) and inteidimer (dsak shading) contacts are indicated. 
The Arg-rich nucleic acid binding domain is symbolised by +. Primers (Table 1) are 
indicated as arrows. 

20 Figure 3: A 12% SDS-PAGE of fractions from a sucrose density gradient separation of 
homo-tandem core particles. 

Figure 4: Electron micrograph of hetero-tandem core particles with a linker comprising 
five repeats of GGS. 

25 

Figure 5: A Western blot showing the efiScient expression of hetero-tandem cores in 
E.C0IL The cores contained 5, 6 and 7 GGS repeats as the linker respectively (GGS5, 
GGS6 and GGS7). 

30 Figure 6: The results of cryo-electromnicroscopy of tandem core particles. Figure 6(a) 
shows tandem core particle (the left-hand particle) in comparison with a native particle 
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(the middle particle), the right-hand part of Figure 6(a) shows the C-terminal part ofcoie 
antigen in a tandein core particle. Figure 6(b) shows the fitting of aportion of ttie 
structure of a tandem core particle with a native particle. 

S Methods 

Examination of the HBV core particle structure suggested that a flexible Imker of at least 
1 .5nm (15 A) could be used to Unk ^ two proteins in a dimer pahr wifliout disnqrting 
Aeir structural integrity (Figure I). Consequently, constructs were made by overlapping 

10 PCRs in vMch the upstream core protem was truncated to residue 149 and then linked to a 
downstream copy via 5, 6 or 7 copies of a GlyGlySer (GGS) repeat sequence (Figure 2). 
The downstream copy was either the full length core protein or was truncated at amino 
acid 149 to remove the Arg-rich C-terminal region. Table 1 gives the ohgonucleotide 
sequences used to construct the various HBV tandem cores. 

15 - ... 

The constructs were cloned into ptrc^Alra^res^on vectoiTtraiisfomied^^ JM 
109 and mduced with BPTG. Cells were then harvested by cMxtrifiigation, resuspended 
mto PBS and sonicated twice. Lysates cont^nmg soluble expcessed tandem cores were 
made 30% saturated ammonium sulphate and the predpitated pzotdns collected by 

20 centrifiigation, resuspended into PBS and dialysed against phosphate-buffered saline. The 
clarified lysate was loaded onto 15-45% linear sucrose gradients and centrifiiged at 28,000 
rpm for 4 hours at 4"C. Graidients were fiactionated from the bottom of the tube into 2 ml 
ahquots and analysed by SDS-PAGE and Western Blottmg usmg a monoclonal primary 
antibody against HBV core protein (mAb 13). 

25 . 

HBV core particle preparations were spotted onto carbon coated grids, negatively stained 
with uracyl acetate and visualized in transmission electron microscopy. The structures of 
the core particles were detramined using ayo-eiectionmiacoscopy. 
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Table 1: Sequences of the oligonucleotide primers used for cloning HBV tandem core 
genes into ptrc99A. 



5 


Primers 


Sequences (5'-»-3') 




1 


GTTACCATGGACATTGACCCTTA'P 




2 


GTCCATAGA(ACCACCAGA)5AACAACAGTAGnTCCX3G 




3 


GTCCATAGA(ACCACCAGA)«AACAACAGTAGTTTCCGG 




4 


GTCCATAGA(ACCACCAGA)7AACAACAGTAGTTTCCGG 


10 


5 


GTrGTr(GGTGGTTCT)5ATGGACATTGACCCTTAT 




6 


GTTGTT(GGTGGTTCT)6ATGGACATTGACCCTTAT 




7 


GTTGTT(GGTGGTTCT)7ATGGACATTGACCCTTAT 




8 


TATGAAGCnTATGAGTCCAAGGA'* 




9 


TATGAAGCTTCCGTCGTCAAACAA'' 



15 



* Ncol restriction site is boldfaced 
' ^ EBndm restriction site is bold&ced 

Results 

20 

Tandem HBV core proteins wifli 5, 6 or 7 copies of GGS were all expressed successfully 
and were shown to migrate in polyacrylamide gels wiHi the expected mobilities. Each 
assembled into core particles as evidenced by their sedimentation in sucrose density 
gradients (Figure 3) and their appearance in the electron microscope (Figure 4). ITie 
25 particles retained their antigenic properties as demonstrated by their reactivity in ELIS A 
and Western blots (Figure 5). Furthermore, the structures of the particles formed by the 
tandem core proteins were indistinguishable from the structure of native core particles in 
cryo-electronmicroscopy (Figure 6). 

30 
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CLABIS 

1 . A protein comprising tandem copies of hepatitis B core antig^ (HBcAg). 

5 . . ■ • ■ ; ' 

2. A protein according to claim 1 whichis adimer of two copies of HBcAg. 

3. A protein according to claim 1 or 2 therein one or more of the copies of HBcAg 
has a heterologoius epitope in tilie el loop. 

10 

4. A protein according to claim 3 wherein all the copies of HBcAg have a 
heterologous epitope in the el loop. 



5, A protein according to claim 4 wherem all the copies have the same heterologous 
15 ^ ' ' epitope in el "loop:^'^'^^^'^^ r '"^ ■ ' 



* 6. A protein according to claim 4 A^erein each copy has a different heterologous 
epitoi>e in the el loop. 

20 7. A protein according to any one of claims 3 to 6 wherein the or each heterologous 
epitope is from the pre-Sl or pre-S2 region of hepatitis B virus (HBV). 

8. A protein according to any one of claims 3 to 7 wherein the or each heterologous 
epitope is in a heterologous sequence of from 10 to 120 amino acid residues in the 

25 el loop- 

9. A protdui according to any one of the preceding claims wherein one or more of the 
copies of HBcAg is truncated at the C-terminus. 
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10. A protein according to any one of the preceding claims 'herein the tandem copies 
of HBcAg are joined by a linker. 

11. A protein according to claim 10\yherein the linker is at least l.Snminlengfh. 

5 . 

12. A protein according to claim 1 0 or 1 1 wherem the linker comprises multiple copies 
of the sequence GlyGlySer (GGS). 

13. A protein according to claim 12 ATvherein the linker comprises 5, 6 or 7 copies of 
10 the sequence GGS. 

14. A particle comprising multiple copies of a protein as claimed in any one of the 
preceding claims. 



15 15i A nucleic acid molecule encoding a protein as claimed in any one of claims 1 to 
13. 

16. A nucleic add molecule according to claim 1 5 which is an expression vector. 
20 17. A host cell comprising a nucleic acid molecule as claimed m claim 15 or 16. 



18. A process for producing a protem as claimed in any one of claims 1 to 13, which 
process comprises culturing a host cell containing a nucleic acid molecule which 
encodes the protein under conditions in which the protein is e?qpressed, and 

25 recovering the protein. 

19. A pharmaceutical composition comprisiog a protein as claimed in any one of 
claims 1 to 13, a particle as claimed in claim 14 or a nucleic acid molecule as 
claimed in claim 15 or 16 and a phaimaceutically acceptable carrier or diluent 

30 
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20. a protein according to any one of claims 1 to 13, a particle according to claim 14 

^ " - or a nucldc add molecule according to claim 15 or 16 for BSC in a method of 

. prophylactic or therqjeuticvacdimtion of thq him ' | 

21 . A protein, particle or nucldc acid molecule according to claim 20 for use in a I 

F-' • ■ — ' -• i 

5 method of prophylactic or therapeutic vaccination of the human or animal body - | 

against HBV. 



22. Useof a protem according to any one of claims 1 to 13, a particle according to ^ 

\ 

claim 14 or a nucleic acid molecule according to claim IS or 16 for the 

10 manufe c ture of a medicament for prophylactic or th^peutic vaccination of the f - 

.* 

himian or animal body agamst HBV. 

23 . A method of prophylactic or therapeutic vaccination of a subject, which method . 
^ comprises adminis tering to the subject a protein as claimed in any one of claims 1 

^I^mi«ssms^ssmss3^i 1 Smsmmsu to 1 3^ a particle as claimed in claim 14 or a nucleic acid molecule as claimed iif^^^^^^^ 

claimed 15 or 16. 
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Fig.3. 

Fractions 
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SEQUENCE LISTING 

'<no> to 

^r;r:r <120 HEPATITIS B CORE ANTIGEN FUSION PROTEINS 

<130> N79405 

;<160> 2 

<170> Patent In Ver. 2.1 

<210> 1 
<211> 639 
<212> DNA 

<213> Hepatitis B virus 

<220> 
<221> CDS 

<222> (1)..(639) _ L 

<400> 1 

atg caa ctt ttt cac etc tgc eta ate ate tct tgt tea tgt cet act 48 
• Met Gin Leu Phe His Leu Cys Leu He He Ser Cys Ser Cys Pro Thr I 
1 5 10 15 I 



gtt caa gee tee aag ctg tgc ctt ggg tgg ctt tgg ggc atg gac ate 96 
Val Gin Ala Ser Lys Leu Cys Leu Gly Trp Leu Trp Gly Met Asp He 

20 . 25 30 4 

gac cet tat aaa gaa ttt gga get act gtg gag tta etc teg ttt ttg 144 | 
Asp Pro Tyr Lys Glu Phe Gly Ala Thr Val Glu Leu Leu Ser Phe Leu 1 
35 . 40 45 . ! 

cot tct gac ttc ttt cet tea gta cga gat ctt eta gat acc gcc tea 192 
Pro Ser Asp Phe Phe Pro Ser Val Arg Asp Leu Leu Asp Thr Ala Ser 
50 55 60 

get ctg tat egg gaa gee tta gag tct cet gag eat tgt tea cet cac 240 I 
Ala Leu Tyr Arg Glu Ala Leu Glu Ser Pro Glu His Cys Ser Pro His I 
65 70 75 80 

cat act gea etc agg caa gea att ctt tgc tgg ggg gaa eta atg act 288 
His Thr Ala Leu Arg Gin Ala He Leu Cys Trp Gly Glu Leu Met Thr 
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eta gct.bcc tgg gtg ggt gtt aat ttg gaa gat cca gcg tct aga gac 336 t 

Leu Ala Thr Trp Val Gly Val Asn Leu Glu Asp Prp Ala Ser Arg Asp I 
100 105 110 I 



eta gta gtc agt tat gtc aac act aat atg ggc eta aag ttc agg caa 384 | 

Leu Val Val Ser Tyr Val Asn Thr Asn Met Gly Leu Lys Phe Arg Gin 1 

115 120 125 i 

etc ttg tgg ttt cae att tct tgt etc act ttt gga aga gaa aca gtt 432 f 

Leu Leu Trp Phe His He Ser Cys Leu Thr Phe Gly Arg Glu Thr Val ? 

130 135 140 I 



ata gag tat ttg gtg tct ttc gga gtg tgg att cgc act cct cca get 480 • 
He Glu Tyr Leu Val Ser Phe Gly Val Trp He Arg Thr Pro Pro Ala 
145 150 155 160 

tat aga cca cca aat gee cct ate eta tea aca ctt ccg gag act act 528 i 
- Tyr Arg Pro Pro Asn Ala Pro He Leu Ser Thr Leu Pro Glu Thr Thr I 

' ' i 

gtt gtt aga cga ega ggc agg tec cct aga aga aga act ccc teg cct 576 
Val Val Arg Arg Arg Gly Arg Ser Pro Arg Arg Arg Thr Pro Ser Pro 
180 185 190 

cgc aga cga agg tct eaa teg ceg egt cgc aga aga tct caa tct egg 624 
Arg Arg Arg Arg Ser Gin Ser Pro Arg Arg Arg Arg Ser Gin Ser Arg 
195 200 205 

gaa tct caa tgt tag' 539 
Glu Ser Gin Cys 
210 

<210> 2 
<211> 212 
<212> PRT 

<213> Hepatitis B virus 
<400> 2 

Met Gin Leu Phe His Leu Cys Leu He He Ser Cys Ser Cys Pro Thr 
15 10 15 

Val Gin Ala Ser Lys Leu Cys Leu Gly Trp Leu Trp Gly Met Asp He 
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20 



25 



30 



Asp Pro Tyr Lys Glu^Wie^Gly Ala Thr Val Glu Leu Leu Ser Phe Leu 



Pro Ser Asp Phe Phe Pro Ser Va1 Arg Asp Leu Leu Asp Thr A1a Ser 

50 55- - ; 60— - 

A1a Leu Tyr Arg Glu Ala Leu Glu Ser Pro Glu His Cys Ser Pro His 

65 . 70 _ : 75 80 

His Thr Ala Leu Arg Gin Ala He Leu Cys Trp Gly Glu Leu Met Thr 
85 90 ' 95 

Leu Ala Thr Trp Val Gly Val Asn Leu Glu Asp Pro Ala Ser Arg Asp 
100 105 110 



Leu Val Val Ser Tyr Val Asn Thr Asn Met Gly Leu Lys Phe Arg Gin 
115 120 125 



Leu Leu Trp Phe His He Ser Cys Leu Thr Phe Gly Arg Glu Thr Val 



130 



140" 



He Glu Tyr Leu Val Ser Phe Gly Val Trp He Arg Thr Pro Pro Ala 
145 150 155 160 

Tyr Arg Pro Pro Asn Ala Pro He Leu Ser Thr Leu Pro Glu Thr Thr 
165 170 175 

Val Val Arg Arg Arg Gly Arg Ser Pro Arg Arg Arg Thr Pro Ser Pro 
180 185 190 . 

Arg Arg Arg Arg Ser Gin Ser Pro Arg Arg Arg Arg Ser Gin Ser Arg 
195 ' 200 205 



Glu Ser Gin Cys 
210 
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